Linguistic expressions as a tool to extract information
نویسنده
چکیده
When searching for multiple words in a corpus database, we all have encountered the problem of not coming up with ‘the right thing’. Indeed, very often the results produced by the computer do not meet the analysis we would have produced as humans. In a text, in order to understand it correctly, we rely not only on the semantic content of the words but also on more grammatical means, which provide reading instructions. Those are the ones we will be dealing with here and we will use French prepositions (pour ce qui est de, en ce qui concerne, à propos de, au sujet de, etc.) which have the syntactic properties of being either detached and generally in an initial position or, of being dependent on another morphosyntactic constituent. This syntactic behaviour is of importance in the way in which information is processed and corresponds to two discoursal functions. When detached, these prepositions introduce a thematic (Charolles 1997) and mark an author’s pragmatic intentions. When introducing a thematic they can integrate one or more propositions. When they depend on another constituent they simply focalise and do not have that integrative property. The ultimate aim of this ongoing analysis is to engineer a tool using linguistic expressions to extract thematic information. The purpose of this paper is three fold: to detail how linguistic information concerning the thematic introducers is captured; to show how they help in segmenting texts; and to give a technical overview of the system currently being developed by the LaLIC team, utilising this linguistic information.
منابع مشابه
Pragmatic expressions in cross-linguistic perspective
This paper focuses on some pragmatic expressions that are characteristic of informal spoken English, their possible equivalents in some other languages, and their use by EFL learners from different backgrounds. These expressions, called general extenders (e.g. and stuff, or something), are shown to be different from discourse markers and to exhibit variation in form, funct...
متن کاملآسیبشناسی میراث زبانی از طریق بازشناسی جایگاه و عملکرد آن در میراث فرهنگی
In international contexts, Intangible heritage is divided into five groups. First one is the oral traditions and expressions, including language. And also the important function of language is known as a vehicle for intangible heritage. Recognizing language in the field of intangible heritage, some languages and linguistic patterns such as the skill of storytelling, minstrelsy and songs are ...
متن کاملAnnotation of Coreference Relations Among Linguistic Expressions and Images in Biological Articles
In this paper, we propose an annotation scheme which can be used not only for annotating coreference relations between linguistic expressions, but also those among linguistic expressions and images, in scientific texts such as biomedical articles. Images in biomedical domain often contain important information for analyses and diagnoses, and we consider that linking images to textual descriptio...
متن کاملHPLC-SPE-NMR: a productivity tool in natural products research
Natural products provide excellent potential leads for drug development because of their chemical diversity and biological functionality. However, the productivity of discovery of new, pharmacologically active natural products has traditionally been low due to inherent difficulties and costs associated with extract dereplication, i.e., isolation, purification and structure elucidation of indivi...
متن کاملHPLC-SPE-NMR: a productivity tool in natural products research
Natural products provide excellent potential leads for drug development because of their chemical diversity and biological functionality. However, the productivity of discovery of new, pharmacologically active natural products has traditionally been low due to inherent difficulties and costs associated with extract dereplication, i.e., isolation, purification and structure elucidation of indivi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001